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STREPTOLYSIN O DERIVATIVES 



10 

A portion of the disclosure of this patent 
document contains material which is subject to copyright 
protection. The copyright owner does not object to the 
15 reproduction by anyone of the patent document or the 

patent disclosure, as it appears in the Patent and 
Trademark Office patent file or records, but otherwise 
reserves all copyright rights whatsoever. 

20 RELATED APPLICATIONS 

This application is related to United States 

Serial No. (Beckman Docket No. 12 8D-122) 

entitled "Streptolysin 0 Varients" by Craig W. Adams, and 

25 United States Serial No. (Beckman Docket No. 

128D-123) entitled "Antibodies to Streptolysin 0 
Derivatives and Varients", by Craig W. Adams and Patty 
Pang. Both applications are being filed simultaneously 
herewith, and both are incorporated herein by reference. 



FIELD OF THE INVENTION 



The present invention is generally related to 
Streptolysin O and more particularly to Streptolysin O 
35 derivatives produced by recombinant DNA technology. 
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BACKGROUND OF THE INVENTION 

Disclosed herein is a derivative fusion 
product of the antigenic substance, Streptolysin O. 
5 Streptolysin 0 is associated in humans with, for example, 

rheumatic fever, such that immunodiagnostic assays for 
evidence of immunological response against Streptolysin O 
are routinely utilized. The disclosed derivative version 
of Streptolysin 0 is produced by recombinant DNA 

10 techniques, is soluble upon expression, is capable of 

being bound by at least one antibody capable of binding 
to wild type SLO, and is hemolytically active. Prior to 
this invention, Streptolysin O could be obtained via the 
bacteria Streptococcus pyogenes. The toxic and 

15 pathogenic properties of Streptolysin O are typically 

monitored by the lysis of red blood cells. 

X. The Genetic Code 

20 The genetic code for a particular protein, 

such as Streptolysin 0 (hereinafter "SLO") , depends upon 
the sequential grouping of three nucleotides, referred to 
as a "codon, 11 and the arrangement of such codons in 
relationship to each other. 

25 

A "nucleotide" consists of a nucleoside and 
one or more phosphate groups. A "nucleoside" consists of 
a nitrogeneous base linked to a pentose sugar. A 
"pentose" sugar comprises five carbon atoms. In a 

30 molecule of deoxyribonucleic acid, or "DNA", -the pentose 

sugar is "deoxyribose, 11 and the nitrogeneous base can be 
adenine ( "A" ) , guanine ( "G" ) r thymine ( "T" ) or cytosine 
("C") . In a molecule of ribonucleic acid, or "RNA", the 
pentose sugar is "ribose", and the nitrogeneous bases are 

35 the same for DNA, except uracil ( "U" ) replaces thymine. 

Three types of RNA, messenger RNA, or "mRNA" , transfer 
RNA, or "tRNA", and ribosomal, or "rRNA", translate the 
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genetic information encoded in the DNA into, e.g. , a 
polypeptide or a protein. Thus, genetic information is 
generally transferred as follows: DNA-»RNA-*protein. 

The sequence of the nitrogenous bases of the 
DNA molecule encodes the genetic information contained in 
that molecule. The sugar and phosphate groups of the DNA 
molecule perform a structural role, forming the backbone 
of a series of DNA molecules, referred to as a DNA 
"macromolecule . ■ DNA consists of two complementary 
strands of nucleotide chains, and these strands are held 
together by (relatively) weak hydrogen bonds. The bases 
of each DNA molecule bind to each other: A always bonds 
with T and C always bonds with G. Thus, the sequence 
5'-ATCG-3' of a first strand lies immediately opposite a 
complementary sequence 5'-TAGC-3' on the other strand. 
This is referred to as "complementary base pairing. " The 
process of complementary base pairing is referred to as 
"hybridization" and results in the formation of a stable 
DNA macromolecule. 

Each codon specifies one amino acid. "Amino 
acids" are the principal components of proteins, and 
"proteins" are the essential constituents of all living 
cells. There are 20 natural amino acids. Because there 
are four nucleotide bases (A, C, G and T) and three 
nucleotides per codon, there are 64 possible codons (4 3 ) . 
Accordingly, because there are only 20 natural amino 
acids, most amino acids are specified by more than one 
codon. This is referred to as "redundancy" or 
"degeneracy". For example, the codons GCG, GCA, GCT and 
GCC all encode for the amino acid Alanine. 

The codon ATG (Met amino acid codon) is the 
normal "start" codon. The codons TAA, TAG and TGA, which 
do not encode for amino acids, are the normal "stop" 
codons. The formation of mRNA is established based upon 
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the start codoii of one strand of the double stranded DNA 
macromolecule such that the resulting single stranded 
mRNA will have a nucleotide sequence complementary to the 
sequence of a single strand of the DNA* When a stop 
5 codon is reached by the mRNA along the DNA molecule, 

translation is stopped- 

The regions along the DNA macromolecule which 
are translated from the mRNA are referred to as "exons" 
10 for eukaryotes, and "translated regions" for prokaryotes. 

"Genes" include exons (eukaryotes) and translated regions 
(prokaryotes) . Thus, genes encode for proteins and/or 
polypeptides. Mammals, for example, are eukaryotes ; 
bacteria, for example, are prokaryotes. 

15 

The natural synthesis of protein takes place 
over a series of several steps. The first step is the 
formation of an mRNA macromolecule complementary to the 
DNA macromolecule, as noted above. Thereafter, tRNA is 
20 manufactured; the tRNA provides a complementary codon 

( 11 ant i- codon" ) for each codon on the mRNA macromolecule. 
Thereafter, rRNA will catalyze the assembly of the codon - 
specific amino acids resulting from the mRNA: tRNA into 
proteins and/or polypeptides. 

25 

II. Recombinant DNA Technology 

Most proteins are produced naturally in 
extremely small quantities. The advent of recombinant 
0 DNA technology has allowed for the production of large 

quantities of proteins that were previously only 
available in such small quantities. 

The following describes a "typical" genetic 
5 manipulation as it might apply to Escherichia coll, a 

typical bacterial host used for cloning. 
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In order to isolate, or "clone", a gene, a DNA 
library is constructed from a DNA sequence (referred to 
as a "genome") using vectors. A "vector" is a small 
circular molecule of double- stranded DNA that occurs 
5 naturally in bacteria, yeast and mammalian cells. 

Vectors generally comprise the following characteristics: 
(i) a DNA sequence encoding a selectable "marker" which 
assures that the vector will be maintained in an 
appropriate host cell (e.g., E. coli) ; (ii) a 
10 controllable transcriptional promoter --by 

"controllable" is meant that the promoter can be 
"switched on" by manipulation of, e.g. the environment of 
the vector; a "promoter" is a region of DNA sequence that 
when switched on produces large amounts of mRNA from the 
15 gene of interest inserted into the vector- -different 
promoters (e.g., l^c, ££p_, tae., etc.) have different 
rates of mRNA production; (iii) translational control 
sequences, for example, an appropriately positioned ATG 
start codon; and (iv) a polylinker; a "polylinker" 
simplifies the insertion of the gene of interest in the 
correct orientation within the vector. Vectors can be 
engineered to provide restriction endonuclease sites on 
either side of an ATG start codon located on the vector 
such that the gene of interest can be inserted next to 
25 the start codon; this allows for immediate transcription 

of the gene upon activation of the promoter gene. 



20 



A "restriction endonuclease" is an enzyme 
which cuts the double- stranded DNA at specified sequences 
of four to eight nucleotides in length, and many 
restriction endonucleases produce staggered cuts that 
leave a short, single- stranded tail at the location of 
the cut. This end is referred to as a "cohesive" or 
"sticky" end because it can form complementary base pairs 
with another sticky end. The genome is cleaved (cut-up) 
by a specified restriction endo-nuclease corresponding to 
the restriction endo-nuclease used to cut the vector, and 
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the individual pieces of the cleaved genome are inserted 
into the vector. Randomly cleaving the entire genome of 
a cell with a specific restriction endo-nuclease is 
typically referred to as the "shotgun" approach to gene 
5 cloning. The shotgun approach can produce an extremely 

large number of DNA fragments, all of which are inserted 
into vectors. 



The individual pieces of the genome and the 
10 vectors, having corresponding sticky ends, are "fused" or 

,r annealed" together to form circular hybrid DNA 
"plasmids" comprising a portion of the genome and the 
vector. 



15 The plasmids are then introduced into host 

cells. There are two types of host cells, "eukaryotic" 
and "prokaryotic". An example of a eukaryotic host cell 
is the Chinese hamster ovary ("CHO") ; an example of a 
prokaryotic host cell is E . coll bacteria. For purposes 
20 of the discussion to follow, attention will focus on 

prokaryotic host cells. 

When the plasmids are introduced into the host 
cell, these cells are referred to as being "transformed" 

25 with the plasmids. As the cells grow and divide, the 

plasmids will similarly replicate to produce copies of 
the plasmids containing the DNA fragments. Each 
transformed cell is referred to as a "genomic DNA clone" 
and the entire collection of transformed cells containing 

30 all of the different DNA fragments is referred to as a 

"genomic DNA library". 



In order to determine which genomic DNA clones 
contain the DNA sequence capable of being copied into a 
35 corresponding mRNA, it is necessary to separate or 

"screen" the genomic DNA clones. There are several ways 
to accomplish this task including, for example, the use 
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of radioactive DNA probes or evidence of immuno- 
reactivity. Screening can be an extremely labor 
intensive process because, as noted, the shotgun approach 
by definition leads to the formation of an extensive 
number of genomic DNA clones, which must be screened to 
find potential candidates of interest. 



III. Streptolysin O 



10 Streptolysin 0 ("SLO") has an approximate 

molecular weight of between about 65,000 and about 70,000 
daltons. SLO belongs to a class of oxygen sensitive 
( "thiol - activated" ) , cell destroying ("cytolytic") toxin 
( "cytotoxin") which are produced by gram-positive 
15 bacterial species belonging to four different genera 

(streptococcus, bacillus, Clostridium and listeria). 

SLO interacts with membrane cholesterol and 
exerts cytolytic -cytotoxic effects on a broad range of 

20 mammalian cells. Additionally, SLO has very potent 

cardiotoxic properties. One of the toxic and pathogenic 
properties associated with SLO is its hemolytic activity, 
i.e. SLO will lyse red blood cells, resulting in the 
release of hemoglobin. SLO can be lethal to laboratory 

25 animals in relatively small doses. Injection of SLO 

into an animal typically results in its immediate death. 

Because SLO is produced by specified bacterial 
species, when these species "invade" a mammalian host, 

3 0 the SLO released by the bacteria is treated by the host 

as a foreign protein. SLO, then, is an antigen. 
"Antigens" are high molecular weight compounds which upon 
entry into the blood stream of a vertebrate stimulate the 
transformation of the small lymphocytes of the B-type 

35 into lymphoblasts. The lymphoblasts secrete antibodies 

specific to the antigen stimulator. The antibodies are 
proteins possessing reactive sites specifically 
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complementary to a reactive feature or site on the 
stimulating antigen. Antibodies generally have the 
property of rendering the antigen harmless to the host 
organism by occupying the immunologically active sites, 
or "epitopes " , on the antigen particles or molecules. 
Anti-SLO antibodies ( 11 AS 0 11 ) are therefore produced by the 
host in response to the secretion of SLO into the host. 
Approximately 80-85% of individuals with current 
streptococcal infection or their sequelae (an after 
effect of a disease or injury) will demonstrate elevated 
levels of ASO. 

Determination of previous and/or current 
infection by the specified bacterial species which 
secretes SLO is possible using immunodiagnos t i c assaying 
techniques which , e.g., rely upon the hemolytic proper- 
ties of SLO and the binding of ASO to SLO. Focusing on 
hemolytic immunodiagnostic assays for SLO, a patient 
sample is added to a known amount of SLO derived from a 
source other than the patient and this mixture is added 
to a known amount of red blood cells such as, for 
example, rabbit red blood cells. Because SLO has 
hemolytic properties, it will lyse these red blood cells. 
However, when ASO binds to SLO, the hemolytic properties 
of SLO are neutralized. Thus, if the sample is obtained 
from a patient having current streptococcal infection or 
their sequelae, there will be elevated levels of ASO in 
the sample. Accordingly, if the mixture results in high 
levels of hemolytic activity, this indicates that there 
is little, if any, ASO in the serum sample (and hence 
little, if any, infection from the SLO secreting 
bacteria) because the known quantity of SLO in the 
mixture is capable of lysing the known quantity of red 
blood cells in the mixture. If the mixture does not lead 
to hemolytic activity, this is indicative of an amount of 
ASO in the sample sufficient to inactivate the known 
quantity of SLO in the mixture. Investigators refer to 
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such an amount of ASO as a "titer". Typically, an ASO 
titer of greater than about 3 00 International Units/ml is 
indicative of infection by a bacterial source capable of 
secreting SLO. Other immunodiagniostic assays for 
determination of infection by SLO secreting bacteria 
include nephelometric and turbidimetric protocols. 

In order to utilize the immunodiagnostic 
assaying technique outlined above, it is necessary to 
have access to sufficient SLO to be added to the mixture. 
One source of SLO is culture broths containing the 
bacteria Streptococcus pyogenes ("S. pyogenes"). 
However, obtaining SLO in this manner is quite difficult 
and costly: for every liter of the S. pyogenes culture 
broth, only about O.Smg of SLO can be expected; the 
typical media for growing S. pyogenes is expensive; S. 
pyogenes is a class 2 pathogen; and SLO obtained in this 
manner contains many other antigenic materials. 
Additionally, SLO obtained by this procedure tends to be 
unstable in liquid form. Accordingly, such SLO 
preparations are most typically supplied as lyophilized 
powder in vials. Before use, the lyophilized powder must 
be reconstituted in a suitable solvent. Unfortunately, 
such reconstituted SLO will rapidly lose its hemolytic 
activity and therefore it must be used within a brief 
period after reconstitution or discarded. This has one 
notable and negative consequence: it is usually 
impossible to test individual serum samples as soon as 
they are obtained. Thus, laboratories which conduct ASO 
assays based upon hemolytic activity typically store the 
individual samples until a sufficient number are 
collected to enable economic use of the lyophilized SLO. 
This can result in an inordinate delay in obtaining test 
results. 

ASO assays which rely upon nephelometric or 
turbidimetric protocols need significant amounts of 
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purified SLO. Because of the costs associated with 
obtaining significant quantities of purified SLO from S. 
pyogenes is expensive, the foregoing hemolytic based 
assay was the first ASO assay to become commercially 
5 available . 

Recombinant DNA techniques for obtaining SLO fusion 
products offer the benefit of obtaining relatively large 
quantities of such products. Using such technology, it 

10 would be possible to avoid the tedious and cost- 

ineffective aspects of obtaining SLO from S. pyogenes . 
As used herein, the term "SLO derivative" is an SLO 
fusion product which is soluble , hemolytically active and 
which is capable of being bound by at least one antibody 

15 to wild type SLO. SLO derivatives are designated herein 

as "rSLO" . These SLO derivatives are provided in large 
quantities, are substantially pure, and maintain 
hemolytic activity. 



20 Such SLO derivatives would be beneficial, e.g., in 

immunodiagnos tic assays which rely upon, e.g, the 
hemolytic properties of wild- type SLO. 

SUMMARY OF THE INVENTION 

25 

The present invention provides SLO variants. 
These variants, designated herein as "mSLO", comprise the 
following characteristics and are broadly defined 
thereby: (i) recognized by wild-type anti- streptolysin 0 

30 antibodies (ASO) f i.e. comprising at least one epitope 

characteristic of wild-type Streptolysin 0; and (ii) 
substantially non-hemolytic activity. As used herein, 
the term "recognized" means capable of binding to at 
least one epitopic site on mSLO; the phrase 

35 "substantially non-hemolytic activity" means a percent 

wild- type SLO specific activity of less than about 75% 
based upon a wild- type SLO specific activity of 4x1 0 5 
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hemolytic units/mg wild- type SLO; and "wild- type SLO" is 
accorded the usual definition associated with such 
phrase, i.e., SLO that is naturally secreted by a capable 
bacterial source. "Wild-type SLO", by definition, does 
not include, e.g., SLO fusion products derived via 
recombinant DNA techniques. 

A particularly useful rSLO in accordance with 
the present disclosure is designated herein as rSLO.3 
having a specific hemolytic activity of about 3.6 x 10 4 
hemolytic units ("HU" ) per mg. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a single strand of the nucleic acid 
sequence of a most preferred embodiment of an SLO 
derivative, designated rSLO.3; and 

Fig. 2 is the amino acid sequence of rSLO.3. 

DETAILED DESCRIPTION OF 
PREFERRED EMBODIMENTS 

As used in this disclosure, Streptolysin 0 
derivatives, or "rSLO", comprise the following 
characteristics and are broadly defined thereby: (i) 
capable of being bound by at least one antibody to wild- 
type SLO; (ii) soluble upon expression; and (ii) 
hemolytically active. As used herein, the phrase "wild- 
type SLO" is accorded the usual definition associated 
with such phrase, i.e., SLO that is naturally' secreted by 
a bacterial source capable of secreting such protein. 
Preferably, the hemolytic activity of rSLO has about 75% 
of the hemolytic activity of wild- type SLO, based upon a 
wild- type SLO specific activity of 4xl0 5 hemolytic 
units/mg. More preferably, the hemolytic activity of 
rSLO is between about 5% and 50%, and most preferably 
about 9%, of the hemolytic activity of wild- type SLO , 
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based upon a wild- type SLO specific activity of 4x10 5 
hemolytic units /mg wild- type SLO. These values are 
relative; thus, if percent wild- type SLO specific 
activity is based upon a wild- type SLO specific activity 
5 of 1x10 6 hemolytic unics/mg, the above values are 

decreased by a factor of 2.5 (i.e./ 75% becomes 30%; 9% 
becomes 3.6%; etc . 

The preceding is detailed because the 
10 "specific activity" of wild- type SLO has been described 

as being as high as about 1x10 6 hemolytic units /mg, 
although specific activity of about 4xl0 5 hemolytic 
units /mg has also been described. Alouf , J. E. 
"Streptococcal Toxins (Streptolysin 0, Streptolysin S, 
15 Erythrogenic Toxin)." Pharmac. Ther. IX : 661-717 (1980), 

which is incorporated herein by reference. Accordingly, 
because the reported "specific activity" of wild type SLO 
is elusive, the foregoing percentages accomodate this 
fact . 

20 

For convenience, as used herein, the term 
"vector" means a circular DNA macromolecule comprising at 
least one restriction site and at least one promoter 
gene. The term "plasmid" means a vector further 
25 comprising a portion of a genome of interest, including, 

inter alia, a gene. The term "host" means a cell capable 
of being transf ected by a plasmid. 

As those in the art appreciate, most vectors 
30 are selected with respect to a desired outcome. For 

example, in a commercial setting, high-level expression 
of the gene of interest will typically be preferred such 
that vectors with an appropriate promoter conducive to 
such expression will be chosen; on the other hand, in a 
35 research setting, such high level expression may not be 

critical such that a vector having translational and 
transcriptional signals that are under the control of 
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regulatory elements of the host may be appropriate. 
Accordingly, in selecting a vector appropriate for the 
desired outcome, it is often useful to concurrently focus 
on the promoter gene of the vector of interest. 

5 

Promoter genes which achieve very high levels 
of mRNA production include, for example, p L , p tac , and Pt7 . 
This list is not intended, nor is it to be construed, as 
an exhaustive list. Rather these promoters are used as 
exemplars for purposes of the discussion to follow. 
Those in the art can readily select an appropriate vector 
having a desired promoter which can provide equivalent 
results vis-a-vis the listed promoters. 

For example, p-n is used in conjunction with T7 
RNA polymerase which synthesizes RNA at a rate several 
times that of E. coli RNA polymerase and which terminates 
transcription less frequently than E. coli RNA 
polymerase. T7 RNA polymerase is highly selective for 
initiation at its own promoter sequence; accordingly, it 
does not initiate transcription from any sequences on E. 
coli DNA. Furthermore, T7 RNA polymerase is resistant to 
antibiotics such as rifampicin that inhibit E. coli RNA 
polymerase. Therefore, the addition of rifampicin, for 
example, to cells that are promoting T7 RNA polymerase 
results in the exclusive expression of genes under the 
control of a T7 RNA polymerase promoter, i.e., p^. 

Expression using the T7 RNA polymerase/p^ 
system relies upon (typically) a two-plasmid system: the 
first plasmid comprises the gene to be expressed and p^; 
the second plasmid comprises the gene for T7 RNA 
polymerase. The second plasmid, e.g. pGPl-2 (which 
comprises the gene for T7 RNA poylmerase; see Tabor and 
Richardson, Proc.Natl .Acad.Sci. U.S.A. 82 : 1074- 
1078(1985)), can either permanently reside in E. coli or 
can be introduced into E. coli with a specialized phage, 



15 



20 



25 
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such as r e.g., an M13 vector (such as, e.g., mGPl-2, see 
Tabor and Richardson), or a X vector (such as, e.g., CBS , 
see Studier and Moffett, J.Mol.Biol. . 189 ; 113-130 (1986)) 
comprising the T7 RNA polymerase gene. 

5 

Typically, the second plasmid comprising the 
T7 RNA polymerase gene is under the control of a heat 
inducible E. coli promoter, i.e., by raising the 
temperature from, e.g., 30°C to 42°C, the heat inducible 

10 E. coli promoter is switched on, which will in turn 

switch on the p^ promoter of the first plasmid, thereby 
leading to the expression of, e.g., the gene of interest. 
Thus, when using a T7 RNA polymerase/p T7 expression 
system, the E. coli system comprises a heat -inducible 

15 promoter, such as, for example lambda P L with a CI 857 

repressor. 



Examples of vectors comprising p^ include, 
e.g., the pT7 series (pT7-5, pT7-6, and pT7-7, which are 
20 derivatives of pT7-l; see Tabor and Richardson, supra . ) 

and the pET series ( see Studier et al. , Methods Enzymol. 
185 r 60 - 89 (1990) ) . 



Another vector system comprises a p L promoter 
25 gene. The p L promoter is derived from the X bacteriophage 

and is one of the most powerful regulated E. coli 
promoter. Transcription from p L can be fully repressed 
and therefore plasmids comprising p L can be stabilized by 
the X repressor, cl. This repressor is typically 
30 supplied by an E. coli host which comprises aii integrated 

copy of a portion of the X genome. Such an E. coli host, 
referred to as an "E. coli lysogen" is characterized as 
follows: (i) it supplies the X regulatory proteins cl and 
N (an anti- termination function); and (ii) it does not 
35 provide lytic components that would normally lead to cell 

lysis. Accordingly, E. coli lysogens transfected with 
plasmids comprising, e.g~, a gene of interest and p L , can 
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be grown initially to high density without expression of 
the gene and subsequently induced to synthesize the 
protein under inactivation of the repressor. Examples of 
p L based vectors are described in, e.g., U.S. Patent No. 
5 4,925,799 ("pASl"), Shatzman and Rosenberg, "The pAS 

Vector System and Its Application to Heterologous Gene 
Express ion in Eschericia coli . » Heptaloay 7:305- 
355(1987), and Rosenberg et al., "The Use of pKC30 and 
its Derivatives for Controlled Expression of Genes . » 
10 Methods Ensymnl i m . 123-139 (1983). 

The p tee promoter is a hybrid promoter based on 
the tac. and lac promoters, de Boer, et al . "The tac 
promoter: A functional hybrid derived from the trp_ and 
15 lac promoters. " Proc. Natl .Acad .Sri .nsa 80:21-25 (1983) ; 

sge alas, Amann, et al. "Vectors bearing a hybrid trp -lac 
promoter useful for regulated expression of cloned genes 
in Eschericia coli." Gene 25:167-178(1983) . Because p Uc 
includes the lac operator region, it can be repressed by 
E. coli strains that overproduce the lac repressor, and 
be fully induced by addition of isopropyl /3-D- 
thiogaloctoside (IPTG) thereto. 

All of the foregoing references are 
25 incorporated herein by reference. 

The choice of an appropriate vector/host 
system is within the realm of the particular needs of the 
artisan. A most preferred vector is based upon the p L 
30 promoter. Table I sets forth a representative (not 

exclusive) list of suitable vectors and hosts, as well as 
the sources thereof. 



20 
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Vector 
pBTacl DNA 

pBTac2 DNA 

pNHSA 

pNH16A 

pNHlSA 

pPROK-1 

pEX2 

pUC19 

p33 
P 33 

P P L -Lamda 



TABLE I 

Host* Source 

JM101, JM105, (1) 
JM107, JM109 

JM101, JM105, (1) 
JM107, JM109 

D1210PH, D1210 (2) 

D1210PH, D1210 (2) 

D1210PH, D1210 (2) 

JM109 (3) 

N4830-1 (3) 

JM101, JM105, (4) 
JM107, JM109 

AR120, AR58 (5) 

AR120, AR58 (5) 

N99cl + -N4830-1 (6) 



* = E. COLI CELL (4) = BETHESDA RESEARCH LABS 

(1) = BOEHRINGER MANNHEIM (5) = SMITHKLINE BECKMAN 

(2) ss Stratagene Cloning Systems now SmithKline beecham 

(3) = CLONETECH LABORATORIES, INC. (6) = PHARMACIA LKB 

For the following examples, the vectors pd33 and 
pBTac2 DNA were utilized in conjunction with the host 
strains AR120 and JM105, respectively, for the subcloning 
(initially from pUC19 vector) and expression of rSL0.3. 

EXAMPLES 



The following Examples directed to preferred 
embodiments are not intended, nor are they to be 
construed to be, limitations on the disclosure of the 
claims to follow. 
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Tg-iranipl e 1 

Preparation of Partially Digested 
Genomic Streptolysin O DNA 

Genomic DNA was isolated from Streptococcus pyogenes 
(ATCC #10389) using the technique described in Kehoe, M. 
et al. Infect . Immun. , 55:3228-3232 (1987) (hereinafter 
"Kehoe, 1987") , which is incorporated herein by 
10 reference. Approximately lmg of S. pyogenes DNA was 

obtained using this procedure (925/zg) . 

To 370^1 of S. pyogenes DNA (2.5/ig//*l) was added 
300^1 of 10X High Salt Buffer (1.0M NaCl; lOOmM tris- 
15 hydroxyamino methane -chloride ("TRIS-C1"), pH7.5; lOOmM 

MgCl 2 ; and lOmM dithriothreotol ("DTT")), 2310/ul of 
deionized H 2 0 and 20/xl of Bgl II (BRL, Gaithersburg , MD, 
Cat. # 5213SA) , for a final volume of 3000/xl. This 
mixture was maintained at 37°C and incubated overnight. 

20 

To this incubated mixture was added 3000/il of 
Reagent A (250^1 phenol, 250/zl chloroform, 10^1 isoamyl 
alcohol, l/il /8-mercapthoethanol) . This mixture was 
agitated prior to centrifugation in order to separate the 

25 aqueous and the organic layer. The aqueous supernantant 

was then precipitated with .3M NaOAc and 95% ethanol. 
The precipitate was then redissolved in 250^1 TE (lOmM 
TRIS-Cl, pH 7.5; lmM EDTA) and 25^1 of 10X loading dye 
(0.2M EDTA; 50% glycerol; 0.25% xylene cyanol; 0.25% 

30 bromophenol blue) was added thereto, followed by 

electrophoresis on 1% agarose gel. The Bgl II partially 
digested S. pyogenes genomic DNA fragments were then 
evaluated according to size. 



35 As noted, SLO has an approximate molecular weight of 

65,000 to 70,000 daltons. Each amino acid has an 
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approximate molecular weight of 110 daltons, such that 
(conservatively estimating) a 70,000 dalton protein would 
be encoded by approximately 63 6 codons , or 1909 base 
pairs. Accordingly/ the partially digested fragments of 
5 between about 2,000 to 2,500 base pairs (i.e., 2.0 to 2.5 

Kb) , as determined by the aforementioned gel 
electrophoresis method, were purified. The purified 
fragments were then resuspended in 150/*1 of TE. For 
convenience, these are designated herein as "SLO 
10 inserts". 



Example 2 

Preparation of Streptolysin O 
Containing Plasmids 

15 

The vector utilized was pUC19 (BRL, Cat. # 5364SA) 
cut with Bam HI (BRL, Cat . # 5201SA) . 

To Ifil of cut pUC19 vector was added 15^1 of the SLO 
20 inserts, 3[il of 10X ligation buffer (660mM TRIS-C1, pH 

7.5; 50mM magnesium chloride; lOmM DTT; lOmM ATP) . A 
final volume of 30/tl was achieved by the addition of 8fil 
of deionized HjO. To this mixture was added 2/zl of T4 
ligase (USB, 5[ig/{il) ; incubation thereof at room 
25 temperature proceeded overnight. For convenience, the 

resulting material is designated as "SLO plasmid 
candidates" . 

0 Screening of SLO Plasmid Candidates 

Host cells E. coli strain JM105 were transformed 
with the SLO plasmid candidates as follows. A vial 
containing 300/tl of frozen JM105 competent cell was 
5 thawed, and 16.0^1 of the SLO plasmid candidates was 

added thereto. This admixture was incubated on ice for 
30min, followed by heat shock in a 37°C water bath for 
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2min, Thereafter, the transfected JM105 solution was 
added to 2ml of LB medium (lOg Bacto- tryptane; 5g Bacto 
yeast extract; lOg NaCl; l liter deionized water; pH 7.5 
with sodium hydroxide) , followed by shaking (200RPM) for 
30min at 37°C. Plating was thereafter accomplished on LB 
Ampicillin plates, followed by incubation overnight at 
37°C; for convenience, these are designated "SLO 
transf ormants " . 



Screening was accomplished utilizing a unique 
procedure. Following overnight growth, the colonies were 
overlaid with 3ml of 2.5% washed rabbit red blood cells 
in 0.8% agarose in PBS/lOmM DTT, which was spread to 
cover the plates. After 40min of incubation at 37°c, 
15 colonies comprising SLO were surrounded by small zones of 
hemolysis. In order to confirm that these colonies 
comprised SLO, a 25-mer oligonucleotide probe derived 
from nucleotides 670 through 694, inclusive, of the 
reported DNA sequence of SLO (see Kehoe, 1987) was used 
20 as a probe. The probe was prepared with a BioSearch 8600 
DNA synthesizer, and labelled with 32 P following the T4 
polynucleotide kinase procedure described in Maniatis et 
al " Molecular Cloning, CSPL (1982), pp. 122-126 
(hereinafter " Molecul ar Cloning " \ . 

25 

The blood overlay screening technique proved to be 
an efficient and accurate method for rapidly screening 
the SLO expressed by the SLO transf ormants . Because a 
property of SLO is the ability thereof to lyse red blood 
cells, red blood cells from any source can be utilized, 
i.e., human, mouse, goat, rabbit, etc. Rabbit red blood 
cells are preferred due to the availability thereof. 

An SLO clone that led to the expression of protein 
35 which evidenced hemolytic activity and which hybridized 
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with the 25-mer probe was designated "pUClS -SLO-B" . For 
convenience, the non-vector DNA sequence thereof is 
designated herein as » rSLO- candidates 11 . 

5 R-x-ampl e 4 

Optimization of Expression and 
Determination of Solubility 

In order to optimize the expression of rSLO- 
10 candidates, timed- digestion of rSLO- candidates using Bal- 
31 was accomplished. Additionally, and as previously 
noted, solubility of the expressed protein ab initio r 
i.e., without further chemical modification once 
expressed, is of import. This is because non- soluble SLO 
15 is by definition inactive. Accordingly, an anlysis was 

also made to determine if the exprersed protein was 
soluble, i.e. was located in a supernatant as opposed to 
a pellet, following centrifugation. 

20 The pUC19-SL0-B was initially cut with BstE II (New 

England Bio Labs, Cat. # 162, 10 U/fil) as follows. To 
20^1 of pUC19-SLO-B (2.5/ig/pl) was added 40^1 of 10X High 
Salt Buffer, 335/il deionized H 2 0, and 5 pi of BstE II. 
This admixture was incubated at 60 °C for 2hrs, followed 

25 by extraction with 400/zl of Reagent A, and precipitation 
with 44^1 of 3M NaOAc (pH 4.8) in 888/xl of 95% ethanol. 
The precipitant was then redis solved in 40/zl H 2 0. 
Thereafter, 90jil of HjO, 20 pi of 10X Bal-31 Buffer (120mM 
CaCl 2 ; 120mM MgCl 2 , 2.0M NaCl; 0.2M TRIS-C1, pH 8.0; lOmM 

30 EDTA) , and 50/xl of lmg/ml Bovine Serum Albumin, was 

admixed with the redissolved precipitant . This was 
followed by the addition of 10 pi of Bal-31 (New England 
Bio Labs, Cat. # 213, 100 U/ml) , for a total of 210^1, 
followed by incubation at room temperature. To control 

35 the effects of Bal-31, 30/xl aliquot s of the 210/il total 

solution was removed at 30, 45, 60, 80, 105, 130 and 
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160min post-Bai-31 addition, and these aliquots were each 
admixed in 3.3/xl of 0.2M EGTA, followed by storage on 
ice. After preparation and storage of the last aliquot, 
all seven aliquots were pooled, extracted with 230/xl of 
5 Reagent A, and precipitated with 23/zl of 3M NaOAc in 

506^1 of 95% ethanol. The precipitate was then 
redissolved in 75 fil of Hp. 

A fill-in reaction followed by the addition of 5/il 
10 of 2.5mM dXTP, lOfil of 10X Medium Salt Buffer (SOOiriM 

NaCl; lOOmM TRIS-C1, pH 7.5; lOOmM MgCl 2 ; lOiriM DTT) , and 
10/xl of lOOmM of DTT to 75/il of the redissolved 
precipitant, followed by the addition thereto of 6jul of 
Klenow polymerase (5 U//xl) , and incubation at room 
15 temperature for 4hrs. This admixture was extracted with 

lOOptl of Reagent A, precipitation with 11/xl of 3M NaOAc 
in 22fil of 100% ethanol, and resuspension of the 
precipitate in 40jil of Hfi. 

20 Following the fill-in reaction, 17/xl of the 

resuspended precipitate was admixed with 3/xl of a linker 
comprising a Bam HI sequence (New England Bio Labs, Cat # 
1021) and 5/xl of 5X linker ligation buffer (250mM TRIS- 
d, pH 7.6; 50mM MgCl 2 ; 5mM DTT; 5mM ATP; 2.5% (w/v) PEG 

25 8000 (J.T.Baker, Cat. # U222-09)). This was followed by 

the addition thereto of 2/xl of T4 ligase (5 TJ/fxl) , and 
incubation thereof for 6hrs at room temperature. For 
convenience, the resulting material is referred to as 
"ca/ew" . 

30 

E. coli strain JM105 was transformed with ca/ew as 
described above, followed by overnight growth as 
described above in Example 3. To determine if the 
plasmids comprised the Bam HI linker, to 40ptl of ca/ew 
35 (0.5/xg/jLil) was added 40/xl of 10X Medium Salt Buffer, and 
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30 



35 



320/tl deionized H 2 0. To this mixture was added Spl EcoRI 
fBRL, Cat # 5202 SA, 10U/ M 1) , followed by incubation at 
37°C for 2hrs. In order to ensure that the plasmid was 
cut, gel electrophoresis (1% agarose gel) was conducted; 
this resulted in a smear of different sizes, indicating a 
successful cut. To the cut plasmid was added Sjttl of 5M 
NaCl, followed by Sfil of Bam HI (10 TJ/nD . This mixture 
was incubated for 37°C for 2hrs. Determination of the 
size of the rSLO-candidate sequence subjected to Bal-31 
digestion was conducted by gel electrophoresis (1% 
agarose gel) . This resulted in a band of interest at 
about 1.2 to about 2.0 Kb which comprised rSLO- 
candidates. Thus, the initial fragments of 2.0 to 2.5 Kb 
which evidenced hemolytic activity had been significantly 
15 decreased in size. 

The band which comprised rSLO- candidate was cut from 
the gel and purified in 15/zl of TE such that rSLO- 
candidate was available for ligation in pUC19 vector 
previously cut with Bam HI and ECoRI. In order to 
accomplish such ligation, i Qfl i of the gel-purified rSLO - 
candidate was admixed with 4/zl of the previously prepared 
vector, 2fil of 10X ligation buffer, 2 M 1 of lOmM ATP, and 
2fil of deionized H 2 0. To this admixture was then added 
2^1 of T4 ligase, followed by incubation at room 
temperature for 6hrs. E. coli host cell strain JM105 was 
transformed with these plasmids as above, and active 
colonies were screened by the red-blood cell overlay 
method disclosed above. Active colonies were then 
selected, innoculated in LB Medium/ lOO/ig/ml Ampicillin 
and grown overnight under the conditions described above. 

Following overnight growth, the cells were 
centrifuged for 5min at 8000RPM at 4°C, and the resulting 
pellet resuspended in 2ml of Reagent B (l50mM Nad; 20mM 



20 



25 
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TRIS, pH 7.0; ImM EDTA) . Thereafter, the resuspended 
cells were subjected to soni cation for 2X 30sec. on ice, 
followed by cent rifugat ion at 9500RPM for 40min at 4°C 
using a Beckman JA20.1 centrifuge to obtain the expressed 
5 protein. 

At this stage, if the rSLO- candidate led to the 
expression of a soluble protein, that protein would be 
located in the supernatant. Accordingly, analysis was 
conducted for the presence of rSLO- candidate in the 
supernatant using standard Western blot protocols for 
determination of an antigenically active protein. The 
results of such Western blot analysis indicated that 
there was an SLO fusion product in the supernatant which 
was recognized by horse anti-SLO antibodies. One such 
fusion product was selected and designated "rSLO.3" For 
convenience, the DNA sequence leading to the expression 
of rSLO.3 is also referred to as rSL0.3. High level 
expression of rSL0.3 was thereafter attempted. 

Rrample 5 

High Level Expression of rSLO.3 

Removal of rSL0.3 from the plasmid comprising pUC19 
vector was effectuated as follows. To 15/xl of the 
plasmid comprising rSLO.3 (0.5/ig//zl) was added to 40jul of 
10X Sma I Buffer (200mM KC1; lOOmM TRIS-C1, pH 8.0; lOOmM 
MgCL 2 ; lOmM DTT) , and 345jul of deionized H 2 0. To this 
mixture was added 5/*l Sma I (BRL, Cat # 5228 SA, 10U//xl) , 
followed by incubation at 37°C for 2hrs . In order to 
ensure that the plasmid was cut, gel electrophoresis (1% 
agarose gel) was conducted; this resulted in a single 
band, indicating a successful cut. To the cut plasmid 
was added 8jul of 5M NaCl, followed by 5/il of Bam HI 



10 



15 



20 
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(lOU/jtl) . This mixture was incubated for 37°C for 2hrs. 
To ensure that the rSL0.3 sequence was successfully cut 
from the approximately 2.7Kb pUC 19 vector, gel 
electrophoresis (1% agarose gel) was conducted. This 
5 resulted in two bands, one at about 2.7Kb (the vector), 

and the other at about 1.4Kb (rSL0.3) . This band was cut 
from the gel and purified in 15pl of deionized H 2 0 such 
that rSLO.3 was available for ligation in pA33 vector 
previously cut with Bam HI and Sma X. 

10 

To 2^1 of above -derived rSLO.3 DNA was added 2 pi of 
the above described vector, 1.5^1 of 10X ligation buffer, 
1.5ftl of lOmM ATP, and a pi of deionized H 2 0. To this 
admixture was added 2^1 of T4 ligase (10 U/ pi) , followed 
15 by incubation for 5hrs at room temperature. Such 

incubation resulted in plasmids comprising rSL0.3 and 
pA33 vector. 

E. coli strain AR120 was transformed with 
20 the above- described plasmids in accordance with the 

procedure outlined for E. coli strain JM105 . Thereafter, 
a DNA mini prep described in Current Protocols in 
Molecular Biology, Auschel, F.M. et al. , Eds, John Wiley 
Sc Sons (New York) (1987) , Section 1,6., followed by 
cutting the plasmids with Bam HI and Sal I (BRL, Cat. # 
5217 SA) to determine if the plasmids comprised rSLO. 3. 
Those host cells transformed with plasmids comprising 
rSL0.3 were then subjected to induction via the nalidixic 
acid protocol. See Mott, J.E. et al "Maximizing gene 
0 expression from plasmid vectors containing the Xp L 

promoter: Strategies for overproducing transcription 
termination factor p. Ir PNAS USA , 82 : 88-92 (19 85) , which 
is incorporated herein by reference. As those in the art 
appreciate, nalidixic acid, which damages DNA, induces 
recA protein, a recovery protein for E. coli. A 



SUBSTITUTE SHEET 



WO 93/05156 



PCT/US92/06398 



-25- 

derivative benefit vis-a-vis overexpression is that recA 
has protease activity, which, inter alia , leads to 
inactivation of \cl + repressor; this inactivation leads 
to overexpression by the p L promoter. 

Specifically, colonies comprising the transformed 
AR120 were lifted from the agar plates and innoculated ii 
Superboth (Base - 12g tryptone, 24g yeast extract, 5ml 
glycerol, 900ml distilled HjO; Salt (per liter of base) - 
1.7g KH 2 P0 4 , 15. 8g K 2 HP0 4 (anhydrous) 100ml distilled H 2 0) 
plus lOO^g/ml ampicillin at 37°C until the optical 
density of the medium at A^ equalled 0.4. Thereafter, 
nalidixic acid was added to the innoculated mixture at a 
final concentration of 60|ig/ml and incubated at 37°C for 
4hrs. Western blot analysis of the supernatant 
demonstrated the presence of rSLO.3 

The DNA and amino acid sequences of rSLO.3 was 
thereafter determined (Lark Sequencing Technologies, 
Houston TX) . A single stranded representation of the 
determined DNA sequence of rSLO.3 is presented in Figure 
1 and the determined amino acid sequence of of rSLO.3 is 
presented in Figure 2. 

El-ramp 1 o g 

Specific Activity of rSLO.3 

Protein concentration and specific activity of non- 
purified SL0.3 was determined immediately following 
nalidixic acid induction. 

Protein concentration for rSL0.3 crude extracts 
was derived using the BioRad Protein Assay method 
(Coomassie Blue G-250) . Nalidixic acid induced protein 
mixtures were centrifuged at 8000RPM for 5min at 4°C and 
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the pellets resuspended in 500/xl sonication buffer (40mM 
TRIS, pH 7.5; ImM EDTA; lmM DTT; 200 mM NaCl) . The 
resuspended pellets were then sonicated for 2X 30sec on 
ice, followed by centrifugation at 12, 000 REM for 40min at 
5 4°C. Thereafter, Spl of the resuspended rSL0.3 mixture 

was analyzed for protein concentration (OD reading at 
£595) , and the protein concentration was determined to be 
4. 6(ig/fJLl . 

10 Specific activity was determined by serial dilutions 

of the above described crude extract and addition thereto 
of washed rabbit red blood cells ("RRBC") , followed by 
spectrophotometric reading (OD reading at A^) . 5ml of 
fresh rabbit blood was washed 2X with 45ml of PBS 

15 including lOmM DTT, followed by centrifugation at 2000RPM 

for 5min at 4°C. Thereafter, 1.125ml of the washed 
rabbit red blood cells ("RRBC") were drawn from the 
bottom of the tube and 48 .875 of PBS/lOmM DTT was added 
thereto. This resulted in a solution comprising 2 .25% 

20 RRBC. For the hemolytic assays, 500^1 of the 2.25% RRBC 

was added to 500^1 of 1:2 serially diluted rSLO.3 in 
PBS/lOmM DTT, followed by incubation at 37° for 30min. 

These serial dilutions were spectrophotometrically 
25 analyzed (OD readings at A 54l ) . This analysis indicated 

that .2/il of the diluted rSLO.3 crude extract caused 50% 
hemolysis of the RRBC; .2^1 of the diluted extract is 
equivalent to 2/zl of the extract itself. Accordingly, 
the rSL0.3 crude extract evidenced one hemolytic unit 
30 ("HIT") per two microliters, or 500 HU/ml, 

As noted, the protein concentration of the crude 
extract was determined to be 4.6mg/ml. Accordingly, the 
specific activity of rSLO . 3 derived from the pA33-AR120 
35 expression system was 108 . 7HU/mg. It is noted that 



SUBSTITUTE SHEET 



WO 93/05156 



PCT/US92/06398 



-27 



because these are values for a crude (i.e. non-purified) 
extract, these values are predicated upon total protein 
concentration of the extract. For a purified extract, 
the specific activity values increase. 

Bvatnpl a, f 

Recovery of rSL0.3 

The following procedure is for approximately 200 
grams of transformed host cells (i.e., approximately 6 
grams total protein) . 

Transformed host cells were resuspended in 2 00mls of 
Reagent C (40mM TRIS, pH 7.5; ImM EDTA; 0.1% 2- 
mercaptoethanol) , followed by the addition of lOOmM PMSF. 
Thereafter, the cells were disrupted by sonication, 
followed by the addition of 4ml of lOOmM PMSF. This 
admixture was centrifuged for 30min at 4°C at 15,000RPM. 

The resulting supernatant was removed and saved; 
200ml of Reagent C was added to the pellet, followed by 
the addition of 4mls of lOOmM PMSF. The resuspended 
pellet was then sonicated, followed by centrifugation as 
above. The resulting supernatant was then removed and 
pooled with the previous supernatant, and the pH thereof 
was adjusted to 7.0 with NaOH. 



To the final volume of supernatant was slowly added 
(with stirring at room temperature) Polymin P (Aldrich 
Chemicals) to a final concentration of 0.75%.- This 
admixture was then centrifuged for 30min at room 
temperature at 10,000 RPM, followed by retrieval of the 
supernatant. Solid sodium sulfate was slowly added with 
stirring to 80% saturation of the supernatant. 
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Thereafter, the admixture was stirred for 2hrs at 
4°C, followed by centrigation for 30min at 4°C at 15,000 
RPM. The pellet was then retrieved and resuspended in 
400mls of saturated ammonium sulfate, pH 7.0. The 
5 admixture was then centrifuged for 30min at 4°C at 10,000 

RPM, followed by retrieval of the pellet and resuspension 
thereof in 200mls in Reagent D (20mM TRIS, pH 7; ImM 
EDTA ; 0 . 1% 2 -mercap toe thanol } . 

10 The resuspended pellet was then dialyzed against 2 

liters of Reagent D, with 4 changes, at 4°C. Sufficient 
room was left in the dialysis bag in that the volume of 
the sample increases- Following dialysis, the pH of the 
sample was checked, and adjusted to 7.0 with NaOH. 

The sample was then loaded onto a Pharmacia Fast 
Flow S-Sepharose column equilibrated in Reagent D. A 
400ml bed volume was found to be sufficient to remove the 
mSLO.3/6 from the sample. The flow through, comprising 
E. coli proteins, was collected and discarded, and the 
column was washed with approximately 1 liter of Reagent 
D. 

The rSLO . 3 was eluted with 2x1 liter 0.0 to 0.4M 
NaCl gradient in Buffer B. The fractions were analyzed 
by SDS acrylamide gel (9%) , and fractions with high 
amounts of rSL0.3 were pooled. Approximately 250ml of 
pooled rSL0.3 was recovered. 

Using the above procedure, approximately -60% of the 
original total protein (i.e. approximately 0.36 grams) 
was rSLO.3, which can be stored at 4°C until needed. 
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Example 8 
Purification of rSL0.3 

Purification of rSL0.3 was accomplished to a purity 
5 of at least 80% using the following protocol. 

Approximately 600g of frozen cell paste derived in 
accordance with the protocol described in Example 9 was 
thawed (37°C) , resuspended in 3 liters of cold lysis 

10 buffer (40mM TRIS-C1, pH 7.0; 1mm EDTA; .1% 2- 

mercaptoethanol ; 2M Nacl; 4°C) and sonicated for 60min at 
4°-l0°C with a Heat Systems Ultrasonics Continuous Plow 
sonicator (Farmingdale, N.Y. , No. W-385) . Thereafter, 
the material was centrifuged on a Beckman JA10 centrifuge 

15 at 9500RPM for 40min at 20° to 26 °C. Approximately 3 

liters of supernatant was retrieved. 

To the supernatant was added at 12.5% stock solution 
of Polymin P precipitant (Aldrich, Milwaukee, Wis.) to a 
20 final concentration of between 0.2 to 0.3%. The solution 

was then stirred for lhr at room temperature and the 
precipitate discarded. The pH of the liquid portion was 
then adjusted to 7.0 with NaOH. This liquid was then 
permitted to stand overnight at room temperature. 

25 

Thereafter, the solution was centrifuged as above, 
and a clear supernatant retrieved. The supernatant was 
then loaded onto a 1 liter phenyl -sepharose HIC column 
(Pharmacia, Piscataway, N.J.) at 2ml/min. at room 
30 temperature. Thereafter, the column was washed with an 
elution buffer (20mM TRIS-C1, pH 7.0; ImM EDTA; 0.1% 
BME) at 7ml/min. Fractions were monitored by SDS-PAGE 
electrophoresis using the Pharmacia Phast-Page™ System. 
Protein concentration was determined with the BioRad 
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Protein Assay Kit. Fractions containing protein were 
then pooled. 



The pooled fractions was then loaded onto a 1 liter 
5 Blue Affinity Column (BioRad, Richmond, California) at 

2ml /mm at room temperature, followed by washing using the 
elution buffer described above at 2 ml /min. at room 
temperature for two column volumes. 

10 Elution of bound protein was accomplished using an 

NaCL density gradient of 0.0 to 0.8M, pH 7.0. Fractions 
were monitored with the Phast-PAGE System and protein 
concentration determined with the BioRad Protein Assay 
Kit. A single peak was obtained at 0.3-0.4M on the NaCL 

15 density gradient. 



Purity of the eluted rSLO.3 was evaluated using a 
Beckman DU 7500 spectrophotometer, based upon analysis of 
major band homogeneity obtained from gel electrophoresis 
20 (12% SDS - polyacrylamide ) of six different amounts of the 

eluted rSL0.3 (16 , 8, 4, 2, 1, .5/tg rSL0.3) . The 
evaluated purity of rSLO.3 based upon major band 
homogeneity is set forth in Table 2: 

Table 2 

rSLO.3 (ftcr) Percent Homogeneity rSL0.3 

30 0.5 99.0% 

1.0 99.0% 

2.0 94.4% 

4.0 82.4% 

8.0 81.4% 

35 16.0 80.1% 
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For determination of hemolytic activity, the 
concentration of purified rSLO.3 was determined. A 
1:25,600 titer of a . 7mg/ml concentration of purified 
rSL0.3 was required to obtain greater than 50% lysis of 
2.5% RRBC. Accordingly, the specific hemolytic activity 
of purified rSL0.3 is about 3.6 x 10 4 HU/mg (25,600 -s- .7). 



10 



As noted, the specific activity and percent 
hemolytic activity of specific versions of rSLO (purified 
rSLO.3), based upon the "specific activity" of wild-type 
SLO, is as follows: 



15 



20 



25 



Specific Activity 
(Hemolytic Activity 
in Hemolytic Units/mg) 

Percent Hemolytic 
Activity of Wild-Type 
SLO 



TABLE 3 



Wild- Type 
SLO 



rSL0.3 



a) lxl 0 6 

b) 4x1 0 3 

a) 100 

b) 100 



3 . 6x1 0 4 
3.6 

9 



Example 9 
in Vivo Toxicity Effects of rSLO.3 

In order to evaluate in vivo toxicity effects 
of rSL0.3, Balb/c mice were administered undiluted and 
diluted intraveneous injections of rSLO.3. Undiluted and 
diluted control suspension buffer was administered to an 
equivalent number of mice. To improve the intraveneous 
injections, the mice were warmed under a heat lamp for 
20-30 minutes of pre- injection. Approximately 20 mice 
were used for each condition. 
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For the undiluted rSLO.3, each mouse received 
an approximate dosage of 17 mg/kg r while for the diluted 
rSLCL3, each mouse received an approximate dosage of 1000 
M?/kg. Control solution buffer did not affect the 
5 control mice. 

Aside from minor ruffling for several minutes 
after injection, none of the mice receiving either 
diluted or undiluted rSL0.3 showed any ill effects from 
10 the intraveneous administrations. Thus, while rSL0.3 is 

hemolytically active, mice receiving injections of rSL0.3 
as described above did not expire. 

Example 9 

15 Subcloning of rSLO.3 

Having obtained, verified and sequenced rSL0.3, 
subcloning and expression thereof using another 
expression/vector system was initiated. The vector, 
pBTac 2 DNA (Boehringer Mannheim, Cat. No. 1081381, 10 fig) 
was cut with Hind III (BRL, Cat. No. 52075A, 10 U/ml) by 
admixing 30^1 of pBTac2 DNA (1/ig/pl) , 30/il of 10X Medium 
Salt Buffer, 240/£l deionized K^O, followed by addition 
thereto of 5^1 of Hind III (BRL, Cat. # 5207 SA, 10U//zl) . 
This admixture was incubated for 2hrs at 3 7°C. 
Thereafter, the admixture was analyzed by agarose 
electrophoresis (1% agarose gel) to determine if the 
vector had been successfully cut; a single band indicated 
that the cut had been successful. 

To the 305^1 admixture was added 300ptl of 
Reagent A. This admixture was then centrifuged for 5min 
at 12, 000RPM on a Beckman microcentrifuge, followed by 
retrieval of the upper liquid layer. To this liquid 
layer was added 33/il of 3M NaOAc (pH 4.8) and 660^1 of 
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ethanol, followed by precipitation overnight at -20°C. 
This was followed by centrifugation for lOmin at 
12,000RPM on a Beckman microcentrifuge. The pellet was 
retrieved and dried by air. The dried pellet was then 
5 resuspended in 150/zl of deionized water. 

In order to blunt (fill-in) one end of the Hind 
III cut vector, the 150/il solution comprising the 
resuspended pellet was admixed with 10/xl of 2 OX dNTP 

10 (2.5mM), 20jil of 10X MSB and 20p.l of lOOmM DTT. This was 

followed by the addition of 4/il of Klenow polymerase (New 
England Biolabs, Cat. No. 210, 5 U/ml) and incubation at 
room temperature for 7hrs. Thereafter, 300^1 of Reagent 
A was added to the incubated mixture, followed by 

15 centrifugation for 5min at 12,000RPM. The upper liquid 

layer was retrieved and precipitated as above. The dried 
pellet was then resuspended in 30/tl of deionized HjO. For 
convenience, the filled- in, Hind III cut vector is 
referred to as "vec.rb". 

20 

Thereafter, vec.rb was cut with Bam HI (BRL, 
Cat No. 5201 SA, 10 U/jtil) . To 30/xl of vec.rb was added 
30^1 of 10X High Salt Buffer and 240^1 of deionized H 2 0. 
To this admixture was added 5^1 of Bam HI, followed by 

25 incubation for 2hrs at 37°C. To the incubated mixture 

was added 300jul of Reagent A, followed by centrifugation 
as above. The upper liquid layer was retrieved and 
precipitated as above. The dried pellet was then 
resuspended in 20/il of deionized HjO. The resuspended 

30 pellet comprised Hind III cut, filled- in, Bam HI cut 

pBTac2 DNA. 

The rSLO.3 was removed from the plasmid 
described above as follows. To 40/il of the plasmid 
35 comprising rSL0.3 (1/tg/i/il) was added 10X Smal Buffer, 
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and 32 Opel deionized H 2 0 . To this mixture was added 5 pel 
Sma I (lOU/pl) , followed by incubation at 37°C for 2hrs. 
In order to ensure that the plasmid was cut, gel 
electrophoresis (1% agarose gel) was conducted; this 
5 resulted in a single band, indicating a successful cut. 

To the cut plasmid was added 8fil of 5M NaCl, followed by 
5{il of Bam HI (10 XS/fil) . This mixture was incubated for 
37°C for 2hrs. To ensure that the rSLO. 3 sequence was 
successfully cut from the approximately 6.3Kb pA33 

10 vector, gel electrophoresis (1% agarose gel) was 

conducted. This resulted in two bands, one at about 
6.3Kb (the vector) , and the other at about 1.4kB 
(rSLO.3) . The 1.4Kb band was cut from the gel and 
purified in 20/il of deionized H 2 0 such that mSLO. 3/6 was 

15 available for ligation in the prepared pBTac2 vector. 

To 3/il of the vector was added 2/zl of mSL0.3/6 r 
1.5/11 of 10X Ligation Buffer (0.66M TRIS-C1 (pH 7.5), 
50mM MgCl 2 , 50mM DTT, lOmM ATP) , 1.5^1 of lOmM ATP and 7/zl 
of deionized H 2 0. Thereafter, 1.5/zl of T4 Ligase was 
added thereto r followed by incubation overnight at room 
temperature. For convenience, this mixture is referred 
to as the "subclone,.". 

E. coli strain JM10 5 was trans feet ed with 
subclone,, as follows. A vial containing 300/il of frozen 
JM105 competent cell was thawed, and 8.0/xl of subclone,, 
was added thereto. This admixture was incubated on ice 
for 30min, followed by heat shock in a 37°C water bath 
for 2min. Thereafter, the transformed JM105 solution was 
added to 2ml of LB medium (lOg Bacto- tryptane; 5g Bacto 
yeast extract; lOg NaCl; 11 deionized water; pH 7.5 with 
sodium hydroxide) , followed by shaking (200RPM) for 3 0min 
at 37°C- Plating was thereafter accomplished on LB 
Ampicillin plates, followed by growth overnight at 37°C. 
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■ Screening was accomplished using the blood overlay method 
disclosed above, and colonies evidencing hemolysis were 
selected. 

Selected screened colonies comprising rSL0.3 
subclones were innoculted in 12ml of Superbroth- 
ampicillin broth. Induction was accomplished by the 
addition of isoprpyl-/S-D-thiogalactopyranoside ("IPTG"), 
at a final concentration of ImM, to the culture broth 
when the culture broth had an OD 600 reading of 0.7. 12ml 
of the resulting solution was centrifuged at 8000RPM for 
lOmin at 4°C and the resulting pellet resuspended in 
l.2ml of PBS/lOmM DTT. The resuspended pellet was 
sonicated for 1.5min; the protein concetration of the 
sonicated extract was determined using the BioRad Protein 
Assay protocol described above. The protein 
concentration was determined to be 9.3mg/ml. This data 
was used to determine the specific hemolytic activity of 
the sonicated extract by titer based upon the 50% lyses 
of 2.5% washed rabbit red blood cell protocol described 
above. The hemolytic activity based upon titer of the 
culture comprising rSL0.3 was determined to be 2.69 x 10 3 - 

The foregoing Examples are directed to the 
generation of an SLO genomic library. As those in the 
art appreciate, another type of library which is much 
less complex than a genomic DNA library is a 
"complementary DNA", or 11 cDNA" , library. cDNA is derived 
directly from mRNA; therefore, by definition, the cDNA 
library is comprised of regions of translation. Methods 
for deriving cDNA libraries based upon mRNA complementary 
to mSLO DNA are considered to be within the purview of 
the skilled artisan such that cDNA-based libraries for 
mSLO are a part of this disclosure. 
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PROTEIN FOLDING 

As the linear arrangement of nucleotides 
defines a specific codon., the arrangement, or sequence, 
5 of amino acids defines the protein, including the 

particular function thereof. However , while the 
particular amino acid sequence is important with respect 
to the identity of the protein, the particular three 
dimensional shape that the protein exhibits is of similar 
10 import. Such specificity in terms of shape, in essence, 

co -defines the properties of the protein because the 
shape of the protein enables the protein to specifically 
interact with other molecules that will only recognize 
that particular protein shape. 

15 

Most proteins spontaneously fold into their 
correct shape. By treating the protein with certain 
denaturing solvents, the protein can "unfold" into a 
flexible chain. When the denaturing agent is removed, 
portions of the flexible chain may refold into their 
original conformation. This is because one of the most 
important factors governing the folding of a protein is 
the distribution of polar (hydrophillic, or "water- 
hating") and non-polar (hydrophobic, or " water- loving " ) 
side chains of the amino acids of that protein. 
Denaturing solvents interfere with the polarity of the 
amino acid side chains. The following amino acids have 
polar side chains: Asn; Gin; Ser; Thr; and Tyr. The 
following amino acids have non-polar side chains: Gly; 
Ala; Val; Leu; Iso; Pro; Pher Met; Trp; and Cys. Amino 
acids with basic and acidic side chains are very polar. 
The following amino acids have basis side chains: Lys; 
Arg; and His. The following amino acids have acidic 
side chains: Asp and Glu. 
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The environment in which proteins naturally 
exist is, by definition, a non- denaturing environment, 
which is most typically aqueous. Accordingly, the 
hydrophobic side chains of a protein tend to be pushed 
5 together in the interior of the protein molecule, which 

enables these to avoid contact with the aqueous 
environment. Polar side chains, on the other hand, tend 
to arrange themselves near the outside of the protein 
molecule, where they can interact with water and other 
10 polar molecules. 

While the molecular mechanisms by which a 
linear DNA sequence is transcribed and translated into a 
precise amino acid sequence of the corresponding 

15 polypeptide is well understood, exactly how the 

polypeptide chain folds simultaneously and autonomously 
into its three-dimensional structure is not clearly 
understood. However, the real potential of synthetic 
DNA, i.e. DNA synthesized via recombinant techniques, 

20 will be realized in the area of protein design. In order 

for this to be realized, however, the mechanism of 
protein folding well have to be more succinctly 
clarified. While the general problem of predicting 
protein structure from the sequence is elusive 

25 (principally because no rules have emerged that allow 

structure to be related to sequence) , it is clear that 
certain portions of the sequence are important to the 
structure and other portions are relatively unimportant 
from a structural point of view such that substitutions 

30 or modifications can be made at these portions. 

Accordingly, it is assumed that portions of the sequence 
of a protein contribute significantly to the stability of 
the folded protein structure. 
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While predicting a protein structure from the 
protein sequence is elusive, proteins r by definition, 
have unique three-dimensional structures which can be 
determined. The following methodologies, for example, 
can be used in the determination of protein structure: 
Crystallography; Optical Activity, Nuclear Magnetic 
Resonance Spectroscopy. 

a) Crvs t o 1 laar aphy 

Proteins are capable of forming crystals. 
Proteins usually crystallize in a condition of saturation 
or super- saturation which can be achieved by altering one 
or more of a number of variables that affect the 
solubility of the proteins. Thus, by altering the ionic 
strength of the solution or by utilization of organic 
polymers, e.g., polyethylene glycol, proteins can be 
crystallized. Techniques for growing protein crystals 
are set forth in SFarang, S.A. Protein Engineering: 
Approaches to the Manipulation of Protein Folding 
(Butterworth, Publisher, Stoneham MA. , 1990), Chpt. 6 
(hereinafter "Narang") . The preceding text book is 
incorporated herein by reference in its entirety. Having 
crystallized the protein, the techniques of x-ray, 
neutron, and electron diffraction can be used to 
determine to structure of the protein, with x-ray 
diffraction being preferred. The protein structure in 
the crystal is assumed to be at or near the minimum 
conformational free energy of the molecule for the 
crystal form. 

b) Optical Activity 

The optical activity of polypeptides/proteins 
due to the asymmetric centers of the amino acids and to 



SUBSTITUTE SHEET 



WO 93/05156 



PCT/US92/06398 



-39- 

the asymmetric conformations thereof, can be utilized to 
determine the structure of polypeptides /proteins . This 
asymmetry causes proteins to interact differently with 
right- and left- circularly polarized light; if the two 
5 beams consequently travel at different speeds through the 

protein, polarized light is rotated. Optical rotatory 
dispersion ("ORD") is the dependence of this rotation 
upon wavelength. In a wavelength region where the 
protein molecule does not absorb light, the rotation 

10 varies gradually with wavelength, but in an absorbance 
region, the rotation first increases sharply in one 
direction, falls to zero at the absorption maximum, and 
then ristes sharply in the opposite direction. There 
will also be unequal absorptoin of left- and right - 

15 circularly polarized light; this is referred to as 

circular dichroism ("CD"). Both CD and OES spectra of a 
protein are very sensitive to the structural conformation 
thereof. Folded proteins generally have significant 
optical activity in the near-UV region (250-300nm) . 

20 

c) Nuclear Magnetic Resonnance Spectroscopy 

Nuclear Magnetic Resonnance Spectroscopy, 
using, e.g., »H, 13 C, 15 N, 13 P or 2 H, has proven to be of 

25 great use in studying protein structure in solution. 

Focusing on 1H, each hydrogen atom in a molecule has a 
nuclear magnetic spin, i.e. the nuclei of the atom act 
like tiny magnets. In the absence of an external 
magnetic field, the magnetic moments of the protons are 

30 randomly oriented. In a Nuclear Magnetic Resonnance 

experiment, a strong external magnetic field is applied 
to the sample along a specified direction, resulting in a 
net alignment of the magnetic moments and a net 
macroscopic magnetization along the specified directional 

35 axis; a short radio -frequency pulse of appropriate 
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surength is then applied, knocking the magnetization 
vector away from this axis. As nhe magnecizacion 
recovers, a transient radio -frequency signal is recorded 
as a function of time. A fourier- transform of this 
5 signal then yields a frequency spectrum. Each proton in 

the molecule gives rise to a peak in this spectrum 
occurring at some characteristic resonnance frequency 
determined by the local electronic environment of that 
proton. The resonnance frequency of a particular proton 

10 is called its "chemical shift" and is measured as an 

offset from some reference frequency. Structured 
information from NMR is derived from the nuclear 
Overhauser effect ( "NOE 11 , which determines whether a pair 
of protons are near each other in space) and the coupling 

15 constants of protons that are separated by three or fewer 

chemical bonds. NOE and coupling constants provide one- 
dimensional data; two-dimensional data is provided by 
inter alia nuclear Overhauser enhancement spectroscopy 
(NOESY) and two-dimensional correlation spectroscopy 

20 (COSY) ; and from such data, three-dimensional protein 

structures can be determined. 

In view of the foregoing information set forth 
with respect to determination of the three-dimensional 
25 structure of protein molecules , the following claims 

directed to DNA macromolecules and amino acids inherently 
include the three dimensional structures associated with 
the protein molecules expressed thereby. 

30 The Examples herein are not to be construed as 

limited to specific vectors, plasmids and host cells 
which are preferred. The rSLO described herein is not to 
be construed as limited solely to the preferred rSLO 
designated rSLO.3, or to the preferred vectors, plasmids 

35 and host cells. Similarly, the preferred rSLO. 3 in no 
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way constitute an admission, either actual or implied, 
that the DNA and amino acid sequences thereof are the 
only DNA and amino acid sequences to which Applicants are 
entitled. They are entitled to the full breadth of 
5 protection under applicable patent laws. 

For purposes of claiming materials by 
designation, AR120 transformed with plasmids comprising 
pA33 - rSLO.3 and JM105 transformed with plasmids 

10 comprising pBTac2 DNA - rSLO.3 were deposited on August 

23, 1991 with the American Type Culture Collection 
(ATCC) , 12301 Parklawn Drive, Rockville, Maryland, 20852 , 
under the provisions of the Budapest Treaty for the 
International Recognition of the Deposit of 

15 Microorganisms for the Purpose of Patent Procedure. 

These were tested by the ATCC on August 27, 1991, and 
determined to both be viable. The ATCC has assigned the 
deposit numbers ATCC 68675 and ATCC 68677, respectively, 
to these materials. 

20 

Based upon the disclosure herein, those skilled 
in the art can readily obtain fragments of the DNA 
sequence set forth in Figure 1 such that the fragment 
continues to maintain at least one epitopic site 

25 characteristic of wild- type SLO and maintains hemolytic 

characteristics. Furthermore, and as noted, conservative 
substitutions of nucleotides can be made without 
concomitant changes in the amino acid sequence, as those 
in the art understand and appreciate. For 

30 example, "computerized back translation" techniques can be 

used, whereby the amino acid sequence is analyzed by a 
computer and the computer determines the optimum 
nucleotides to utilize in the codons necessary to encode 
such amino acids. Additionally, as the DNA synthesis art 

35 progresses such that oligonucleotides having the length 
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of the DNA sequence of Figure 1 can be rapidly obtained, 
one can synthesize that: sequence as appropriate with such 
advances in the art. 

5 Because the screening of SLO derivatives can be 

readily accomplished using the above -described blood 
overlay technique, numerous SLO derivative candidates can 
be rapidly evaluated. Accordingly, those skilled in the 
art can readily use this technique to derive SLO 
10 derivative analog candidates, rapidly screen these 

candidates for indications of hemolytic activity,, and 
determine the nucleic acid and amino acid sequences of 
analogs 

15 Accordingly, while the the Examples herein are 

directed to a specific SLO variant, rSLO . 3 , and because 
having had this advance in the art placed in their 
possession, those in the art can utilize techniques known 
to the art to adapt this advance to their own ends, 

20 Applicant's invention is seen to comprehend SLO 

derivatives having the characteristics as defined, and is 
not limited to the specific derivative disclosed in the 
Examples . 

25 Although the present invention has been 

described in considerable detail with regard to certain 
preferred embodiments thereof, other embodiments within 
the scope of the teachings of the present invention are 
possible. As such, while the production of a specific 

30 SLO derivative has been described in detail, this is to 

be construed as an exemplar. Accordingly, neither the 
disclosure, nor the claims to follow, are intended, nor 
should be construed to be, limited by the descriptions of 
the preferred embodiments contained herein. 

35 
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37 C.P.R. §1. 821(c) SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: Adams, Craig M, and Wang, Eva Y. 

(ii) TITLE OF INVENTION: Streptolysin 0 
Derivatives 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Beckman Instruments, Inc. 

(B) STREET: 2500 Harbor Blvd. 

(C) CITY: Fullerton 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 92634 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 Mb 

(B) COMPUTER: IBM 

(C) Operating System: MS. DOS 

(D) SOFTWARE: WordPerfect 5.1 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: (Herewith) 

(B) FILING DATE: (Herewith) 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: (Not Applicable) 
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(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION 

5 

(A) NAME: Burgoon, Richard P. 

(B) REGISTRATION NUMBER: 34,787 

(C) REFERENCE/DOCKET NUMBER: 128D-1023 

10 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (714) 773-7610 

(B) TELEFAX: (714) 773-7936 

15 (2) INFORMATION FOR SEQ ID NO.: 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1524 base pairs 

20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULAR TYPE: genomic DNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 
30 (vii) IMMEDIATE SOURCE: 

(A) LIBRARY: genomic 

(B) CLONE: rSLO.3 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO. : 1 
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ATG GAT CCG TCA GAA GAC AAA AAA AAG AGC GAA GAA GAT CAC ACT GAA 4 8 
GAA ATC AAT GAC AAG ATT TAT TCA CTA AAT TAT AAT GAG CTT GAA GTA 9 6 
CTT GCT AAA AAT GGT GAA ACC ATT GAA AAT TTT GTT CCT AAA GAA GGC 144 
GTT AAG AAA GCT GAT AAA TTT ATT GTC ATT GAA AGA AAG AAA AAA AAT 192 

ATC AAC ACT ACA CCA GTC GAT ATT TCC ATC ATT GAC TCT GTC ACT GAT 240 

AGG ACC TAT CCA GCA GCC CTT CAG CTG GCT AAT AAA GGT TTT ACC GAA 288 

AAC AAA CCA GAC GCG GTA GTC ACC AAG CGA AAC CCA CAA AAA ATC CAT 336 

ATT GAT TTA CCA GGT ATG GGA GAC AAA GCA ACG GTT GAG GTC AAT GAC 384 

CCT ACC TAT GCC AAT GTT TCA ACA GCT ATT GAT AAT CTT GTT AAC CAA 432 

TGG CAT GAT AAT TAT TCT GGT GGT AAT ACG CTT CCT GCC AGA ACA CAA 480 

TAT ACT GAA TCA ATG GTA TAT TCT AAG TCA CAG ATT GAA GCA GCT CTA 528 

AAT GTT AAT AGC AAA ATC TTA GAT GGT ACT TTA GGC ATT GAT TTC AAG 576 

TCG ATT TCA AAA GGT GAA AAG AAG GTG ATG ATT GCA GCA TAC AAG CAA 624 

ATT TTT TAC ACC GTA TCA GCA AAC CTT CCT AAT AAT CCT GCG GAT GTG 672 

TTT GAT AAA TCA GTG ACC TTT AAA GAG TTG CAA CGA AAA GGT GTC AGC 720 

AAT GAA GCT CCG CCA CTC TTT GTG AGT AAC GTA GCC TAT GGT CGA ACT 768 

GTT TTT GTC AAA CTA GAA ACA AGT TCT AAA AGT AAT GAT GTT GAA GCG 816 

GCC TTT AGT GCA GCT CTA AAA GGA ACA GAT GTT AAA ACT AAT GGA AAA 864 

TAC TCT GAT ATC TTA GAA AAT AGC TCA TTT ACA GCT GTC GTT TTA GGA 912 

GGA GAT GCT GCA GAG CAC AAT AAG GTA GTC ACA AAA GAC TTT GAT GTT 9 60 
ATT AGA AAC GTT ATC AAA GAC AAT GCT ACC TTC AGT AGA AAA AAC CCA 1008 
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GCT TAT CCT ATT TCA TAC ACC AGT GTT TTC CTT AAA AAT AAT AAA ATT 1056 

GCG GGT GTC AAT AAC AGA ACT GAA TAC GTT GAA ACA ACA TCT ACC GAG 1104 

TAC ACT AGT GGA AAA ATT AAC CTG TCT CAT GAA GGC GCG TAT GTT GCT 1152 

CAA TAT GAA ATC CTT TGG GAT GAA ATC AAT TAT GAT GAC AAA GGA AAA 1200 

GAA GTG ATT ACA AAA CGA CGT TGG GAT AAC AAC TGG TAT AGT AAG ACA 1248 

TCA CCA TTT AGC ACA GTT ATC CCA CTA GGA GCT AAT TCA CGA AAT ATA 1296 

CGT ATC ATG GCT AGA GAG TGC ACC GGC TTA GCT TGG GAA TGG TGG CGA 1344 

AAA GTG ATC GAC GAA AGA GAT GTG AAA CTG TCT AAA GAA ATC AAT GTC 1392 

AAC ATC TCA GGA TCA ACC CTG AGC CCA TAT GGT TCG ATT ACT TAT AAG 1440 

TAG GAC TGG TTC AAG AGG TTC GTC AAG CAC CTT GAT GCT GCT TAT CTC 1488 

TTG AGA TCC CCG GGT AGG CCT AGT TAA CTA GTC GAC 1524 

(3) INFORMATION FOR SEQ ID NO.: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 amino acids 

(B) TYPE : Amino Acid 

(C) TOPOLOGY: linear 

(ii) MOLECULAR TYPE: protein 
(ix) FEATURE: 

(A) NAME/KEY: Signal sequence 

(B) LOCATION: amino acid 98 to amino acid 571 

of SLO 
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(C) IDENTIFICATION METHOD: experimentally 
determined based upon production of 
soluble, hemolytically active SLO from 
recombinant vector 

(D) OTHER INFORMATION: Lyses red blood cells 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO. : 2 
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Met Asp Pro Ser Glu Asp Lys Lys Lys Ser Glu Glu Asp His Thr Glu 
-2 -1 5 10 

Glu lie Asn Asp Lys lie Tyr Ser Leu Asn Tyr Asn Glu Leu Glu Val 
15 20 25 30 

Leu Ala Lys Asn Gly Glu Thr lie Glu Asn Phe Val Pro Lys Glu Gly 

35 40 45 

Val Lys Lys Ala Asp Lys Phe He Val He Glu Arg Lys Lys Lys Asn 
50 55 60 

He Asn Thr Thr Pro Val Asp He Ser He He Asp Ser Val Thr Asp 
65 70 75 

Arg Thr Tyr Pro Ala Ala Leu Gin Leu Ala Asn Lys Gly Phe Thr Glu 
80 85 90 

Asn Lys Pro Asp Ala Val Val Thr Lys Arg Asn Pro Gin Lys He His 
95 100 105 110 

He Asp Leu Pro Gly Met Gly Asp Lys Ala Thr Val Glu Val Asn Asp 

115 120 125 

Pro Thr Tyr Ala Asn Val Ser Thr Ala He Asp Asn Leu Val Asn Gin 
130 135 140 

Trp His Asp Asn Tyr Ser Gly Gly Asn Thr Leu Pro Ala Arg Thr Gin 
145 150 155 

Tyr Thr Glu Ser Met Val Tyr Ser Lys Ser Gin He Glu Ala Ala Leu 
160 165 170 
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Asn Val Asn Ser Lys lie Leu Asp Gly Thr Leu Gly lie Asp Phe Lys 
17 5 180 185 190 

Ser He Ser Lys Gly Glu Lys Lys Val Met He Ala Ala Tyr Lys Gin 

195 200 205 

He Phe Tyr Thr Val Ser Ala Asn Leu Pro Asn Asn Pro Ala Asp Val 
210 215 220 

Phe Asp Lys Ser Val Thr Phe Lys Glu Leu Gin Arg Lys Gly Val Ser 
225 230 235 

Asn Glu Ala Pro Pro Leu Phe Val Ser Asn Val Ala Tyr Gly Arg Thr 
240 245 250 

Val Phe Val Lys Leu Glu Thr Ser Ser Lys Ser Asn Asp Val Glu Ala 
255 260 265 270 

Ala Phe Ser Ala Ala Leu Lys Gly Thr Asp Val Lys Thr Asn Gly Lys 

275 280 285 

Tyr Ser Asp He Leu Glu Asn Ser Ser Phe Thr Ala Val Val Leu Gly 
290 295 300 

Gly Asp Ala Ala Glu His Asn Lys Val Val Thr Lys Asp Phe Asp Val 
305 310 315 

He Arg Asn Val He Lys Asp Asn Ala Thr Phe Ser Arg Lys Asn Pro 
320 325 330 

Ala Tyr Pro He Ser Tyr Thr Ser Val Phe Leu Lys Asn Asn Lys He 
335 340 345 350 

Ala Gly Val Asn Asn Arg Thr Glu Tyr Val Glu Thr Thr Ser Thr Glu 

355 360 365 
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Tyr Thr Ser Gly Lys lie Asn Leu Ser His Gin Gly Ala Tyr Val Ala 
370 375 380 

Gin Tyr Glu lie Leu Trp Asp Glu lie Asn Tyr Asp Asp Lys Gly Lys 
385 390 395 

Glu Val lie Thr Lys Arg Arg Trp Asp Asn Asn Trp Tyr Ser Lys Thr 
400 405 410 

Ser Pro Phe Ser Thr Val lie Pro Leu Gly Ala Asn Ser Arg Asn lie 
415 420 425 43 0 

Arg lie Met Ala Arg Glu Cys Thr Gly Leu Ala Trp Glu Trp Trp Arg 

435 440 445 

Lys Val lie Asp Glu Arg Asp Val Lys Leu Ser Lys Glu lie Asn Val 
450 455 460 

Asn lie Ser Gly Ser Thr Leu Ser Pro Tyr Gly Ser lie Thr Tyr Lys 
465 470 475 
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What is claimed is: 

1. A purified and isolated DNA sequence consisting 
essentially of a DNA sequence encoding a soluble 
derivative of Streptoylsin 0 evidencing hemolytic 
activity, 

2. A DNA sequence encoding a solube derivative 
Streptolysin 0, wherein said DNA sequence is as set forth 
in Figure 1. 

3 . A prokaryotic or eukaryotic host cell transormed or 
transfected with a DNA sequence according to claim 1 or 2 
in a manner allowing the host cell to express 
Streptolysin 0. 

4. A DNA plasmid comprising a DNA vector and a DNA 
sequence according to claim 1 or 2. 

5. A prokaryotic or eukaryotic host cell transformed or 
transfected with a DNA plasmid according to claim 4. 

6. A purified and isolated DNA sequence consisting 
essentially of a DNA sequence encoding a soluble 
polypeptide having an amino acid sequence of that of a 
derivative of Streptoylsin 0. 

7. An amino acid sequence encoding a soluble variant of 
Streptolysin 0 wherein said amino acid sequence is as set 
forth in Figure 2 . 

8. A derivative of Streptolysin 0 having substantially 
the same characteristics as the material capable of being 
expressed by material having ATTC deposit number ATCC 
68675. 
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9. A derivative of Streptolysin 0 having substantially 
the same characteristics as the material capable of being 
expressed by the material having ATCC deposit number ATCC 
68677. 
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ATG GAT CCG TCA GAA GAC AAA AAA AAG AGC GAA GAA GAT CAC ACT GAA 4 8 
GAA ATC AAT GAC AAG ATT TAT TCA CTA AAT TAT AAT GAG CTT GAA GTA 9 6 
CTT GCT AAA AAT GGT GAA ACC ATT GAA AAT TTT GTT CCT AAA GAA GGC 144 
GTT AAG AAA GCT GAT AAA TTT ATT GTC ATT GAA AGA AAG AAA AAA AAT 192 
ATC AAC ACT ACA CCA GTC GAT ATT TCC ATC ATT GAC TCT GTC ACT GAT 240 
AGG ACC TAT CCA "GCA GCC CTT CAG CTG GCT AAT AAA GGT TTT ACC GAA 2 88 
AAC AAA CCA GAC GCG GTA GTC ACC AAG CGA AAC CCA CAA AAA ATC CAT 336 
ATT GAT TTA CCA GGT ATG GGA GAC AAA GCA ACQ GTT GAG GTC AAT GAC 384 
CCT ACC TAT GCC AAT GTT TCA ACA GCT ATT GAT AAT CTT GTT AAC CAA 432 
TGG CAT GAT AAT TAT TCT GGT GGT AAT ACG CTT CCT GCC AGA ACA CAA 480 
TAT ACT GAA TCA ATG GTA TAT TCT AAG TCA CAG ATT GAA GCA GCT CTA 528 
AAT GTT AAT AGC AAA ATC TTA GAT GGT ACT TTA GGC ATT GAT TTC AAG 576 
TCG ATT TCA AAA GGT GAA AAG AAG GTG ATG ATT GCA GCA TAG AAG CAA 624 

ATT TTT TAC ACC GTA TCA GCA AAC CTT CCT AAT AAT CCT GCG GAT GTG 672 

TTT GAT AAA TCA GTG ACC TTT AAA GAG TTG CAA CGA AAA GGT GTC AGC 720 

AAT GAA GCT CCG CCA CTC TTT GTG AGT AAC GTA GCC TAT GGT CGA ACT 768 

GTT TTT GTC AAA CTA GAA ACA AGT TCT AAA AGT AAT GAT GTT GAA GCG 816 

GCC TTT AGT GCA GCT CTA AAA GGA ACA GAT GTT AAA ACT AAT GGA AAA 864 

TAC TCT GAT ATC TTA GAA AAT AGC TCA TTT ACA GCT GTC GTT TTA GGA 912 

GGA GAT GCT GCA GAG CAC AAT AAG GTA GTC ACA AAA GAC TTT GAT GTT 960 

ATT AGA AAC GTT ATC AAA GAC AAT GCT ACC TTG AGT AGA AAA AAC CCA 1008 

GCT TAT CCT ATT TCA TAC ACC AGT GTT TTC CTT AAA AAT AAT AAA ATT 1056 

GCG GGT GTC AAT AAC AGA ACT GAA 'TAC GTT GAA ACA ACA TCT. ACC GAG 1104 

TAC ACT AGT GGA AAA ATT AAC CTG TCT CAT CAA GGC GCG TAT GTT GCT 1152 

CAA TAT GAA ATC CTT TGG GAT GAA ATC AAT TAT GAT GAC AAA GGA AAA 1200 
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GAA GTG ATT ACA AAA CGA 
TCA CCA TTT AGC ACA GTT 
CGT ATC ATG'GCT AGA GAG 
AAA GTG ATC GAC GAA AGA 
AAC ATC TCA GGA TCA ACC 
TAG GAC TGG TTC AAG AGG 
TTG AGA TCC CCG GGT AGG 



CGT TGG GAT AAC AAC TGG 
ATC CCA CTA GGA GCT AAT 
TGC ACC GGC TTA GCT TGG 
GAT GTG AAA CTG TCT AAA 
CTG AGC CCA TAT GGT TCG 
TTC GTC AAG CAC CTT GAT 
CCT AGT TAA CTA GTC GAC 
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Met Asp Pro Ser Glu Asp Lys Lys Lys Ser Glu Glu Asp His Thr Glu 
" 2 - 1 5 10 

Glu lie Asn Asp Lys He Tyr Ser Leu Asn Tyr Asn Glu Leu Glu Val 
15 20 25 30 

Leu -Ala Lys Asn Gly Glu Thr He Glu Asn Phe Val Pro Lys Glu Gly 

35 40 45 

Val Lys Lys Ala Asp Lys Phe He Val He Glu Arg Lys Lys Lys Asn 
50 55 60 

He Asn Thr Thr Pro Val Asp He Ser He He Asp Ser Val Thr Asp 
65 70 75 

Arg Thr Tyr Pro Ala Ala Leu Gin Leu Ala Asn Lys Gly Phe Thr Glu 
80 85 90 

Asn Lys Pro Asp Ala Val Val Thr Lys Arg Asn Pro Gin Lys He His 
95 100 105 ' 110 

He Asp Leu Pro Gly Met Gly Asp Lys Ala Thr Val Glu Val Asn Asp 

115 120 125 

Pro Thr Tyr Ala Asn Val Ser Thr Ala He Asp Asn Leu Val Asn Gin 
130 135 140 

Trp His Asp Asn Tyr Ser Gly Gly Asn Thr Leu Pro Ala Arg Thr Gin 
145 150 155 s 

Tyr Thr Glu Ser Met Val Tyr Ser Lys Ser Gin He Glu Ala Ala Leu 
i6U 165 170 

Asn Val Asn Ser Lys lie Leu Asp Gly Thr Leu Gly He Asp Phe Lys 

±lj0 185 " 190 

Ser He Ser Lys Gly Glu Lys Lys Val Met He Ala Ala Tyr Lys Gin 

195 200 205 

He Phe Tyr Thr Val Ser Ala Asn Leu Pro Asn Asn Pro Ala Asp Val 
210 215 220 

Phe Asp Lys Ser Val Thr Phe Lys Glu Leu Gin Arg Lys Gly Val Ser 
225 230 235 

Asn Glu Ala Pro Pro Leu Phe Val Ser Asn Val Ala Tyr Gly Arg Thr 
240 245 250 
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Val Phe Val Lys Leu Glu Thr Ser Ser Lys Ser Asn Asp Val Glu Ala 
255 260 265 " 29Q 

Ala Phe Ser Ala Ala Leu Lys Gly Thr Asp Val Lys Thr Asn Gly Lys 

275 280 285 

Tyr Ser Asp He Leu Glu Asn Ser Ser Phe Thr Ala Val Val Leu Gly 
290 295 300 

Gly Asp Ala Ala Glu His Asn- Lys. Val Val Thr L ys Asp Phe Asp Val 
305 310 315 

He Arg Asn Val He Lys Asp Asn Ala Thr Phe Ser Arg Lys Asn Pro 

325 330 

Ala Tyr Pro lie Ser Tyr Thr Ser Val Phe Leu Lys Asn Asn Lys lie 

340 345 350 

Ala Gly Val Asn Asn Arg Thr Glu Tyr Val Glu Thr Thr Ser Thr Glu 

355 360 365 

Tyr Thr Ser Gly Lys lie Asn Leu Ser His Gin Gly Ala Tyr Val Ala 
. 375 330 

Gin Tyr Glu He Leu Trp Asp Glu He Asn Tyr Asp Asp Lys Gly Lys 

Glu Val lie Thr Lys Arg Arg Trp Asp Asn Asn Trp Tyr Ser Lys Thr 

4Ud 410 

Ser p ro Phe Ser Thr Val He Pro Leu Gly Ala Asn Ser Arg Asn He 
15 420 425 430 

Arg He Met Ala Arg Glu Cys Thr Gly Leu Ala Trp Glu Trp Trp Arg 

435 440 ' 445 

Lys Val lie Asp Glu Arg Asp Val Lys Leu Ser Lys Glu He Asn Val 
4t)U 455 460 

Asn He Ser Gly Ser Thr Leu Ser Pro Tyr -Gly Ser He Thr Tyr Lys 
<±oj 470 475 2 2 
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